OUTLIER DETECTION ON HIGH DIMENSIONAL DATA USING MINIMUM VECTOR VARIANCE (MVV)
نویسندگان
چکیده
High-dimensional data can occur in actual cases where the variable p is larger than number of observations n. The problem that often occurs when adding dimensions indicates points will approach an outlier. Outliers are part do not follow distribution pattern and located far from center. existence outliers needs to be detected because it lead deviations analysis results. One methods used detect Mahalanobis distance. To obtain a robust distance, Minimum Vector Variance (MVV) method used. This study compare MVV with classical distance detecting non-invasive blood glucose level data, both at p>n n>p. test results show better for shows more effective identifying minimum group outlier method.
منابع مشابه
Outlier Detection on High Dimensional Data Using RNN
Background: Outlier detection is an important factor in data mining since it is used in various real time applications. Outlier is an extreme points that are not related to any of the class. Dealing with dimensions is the great challenge, due to “curse of dimensionality”, for effective outlier detection. In a high dimensional data space, it is difficult to detect most related points and most un...
متن کاملOutlier Detection for Support Vector Machine using Minimum Covariance Determinant Estimator
The purpose of this paper is to identify the effective points on the performance of one of the important algorithm of data mining namely support vector machine. The final classification decision has been made based on the small portion of data called support vectors. So, existence of the atypical observations in the aforementioned points, will result in deviation from the correct decision. Thus...
متن کاملOutlier detection for high dimensional data pdf
Is particularly useful for high dimensional data where outliers cannot be found.High dimensional data in Euclidean space pose special challenges to data. In about just the last few years, the task of unsupervised outlier detection has found.Outlier detection is an outstanding data mining task referred to open pdf with mac word class="text" href="https://tokiqivy.files.wordpress.com/2015/06/opel...
متن کاملOutlier detection for high-dimensional data
Outlier detection is an integral component of statistical modelling and estimation. For highdimensional data, classical methods based on the Mahalanobis distance are usually not applicable. We propose an outlier detection procedure that replaces the classical minimum covariance determinant estimator with a high-breakdown minimum diagonal product estimator. The cut-off value is obtained from the...
متن کاملHybrid Approach for Outlier Detection in High Dimensional Data
It has been observed recently that the prominence of multidimensional data is increasing. Existing outlier detection techniques generally fail to work on multi-dimensional data. The need for analyzing high dimensional data has thus increased in today’s data trends. It has enormous application in medical domain, network intrusion and satellite imagery. Even though there are existing methodologie...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Barekeng
سال: 2022
ISSN: ['1978-7227', '2615-3017']
DOI: https://doi.org/10.30598/barekengvol16iss3pp797-804